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ABSTRACT 



The distribution of a measure of distance between a probability 
density function and its estimate is examined through Monte Carlo 
methods • The estimate of the density function is that proposed by 
Rosenblatt using sums of weight functions centered at the observed 
values of the random variables. The weight function used in all 
cases was triangular, but both uniform and Cauchy densities were 
tried for different sample sizes and bandwidths. The simulated dis- 
tributions appear in all cases to be close to Gamma distributions, 
but it has not been possible to relate the parameters to the popu- 
lation characteristics or to the window shape and bandwidth. 
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I. INTRODUCTION 



There are several recently proposed classes of empirical proba- 
bility density function [Refs. 1-5] all generally considered to be 
superior to the classical histogram estimates* The class considered 
in this paper is based on independent observations, i.e., X^, X 2 ,... 

are independent and identically distributed random variables with 
continuous unknown density function f (x) . The method used to esti- 
mate f(x) is that proposed by Rosenblatt [4] and is as follows. 
Denote the estimator of f (x) by f n (x). Then 



f n (x) 



1 

nb(n) 




9 



where W(u) is a bounded integrable weight function with 



00 



/ 

— CO 



W (u) 



du 



1 



and b(n) is a bandwidth that tends to zero as n 00 , but is such 

that o(b(n)) = n i.e. it converges at a rate slower than 1/n. 

1/2 

Thus we might have b(n) ~ 1/n . Note that all estimates of this 

form for given observations are themselves density functions; that 
is 



f n (x) > 0 , 




f (x) dx = 1 
n 



\ 
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Since the X^'s are random variables, f^Cx) i s a continuous parameter 
stochastic process. 

The local properties of such estimates satisfying relatively mild 
conditions can be shown to be biased [3] . Our object in this thesis 
is to investigate a global measure of how good f R (x) is as an esti- 
mate of f (x) . This measure was proposed by Bickel and Rosenblatt [2]. 
The particular global measure suggested for investigation is the 
statistic: 



whose value will vary with each realization of X, , X , i.e. it 

I n 

is a statistic or function of the n random variables. 

Since an exact distribution of this statistic is not mathematically 
tractable, we have examined some representative cases by simulation. 

The conjecture made by Bickel and Rosenblatt [2] is that the 
statistic B(n) is asymptotically normally distributed regardless of 
the underlying distribution f (x) . With this conjecture in mind, we 
hoped to establish a criterion that can be used to perform goodness 
of fit testing. 

A problem with this type of criterion is that, unlike the Kolmo- 
gorov-Smirnov or Cramer-von-Mises statistics for goodness of fit of 
an empirical distribution function to the unknown distribution, B(n) 
is not distribution free and its distribution for finite n is unknown 
for any case. However, it is believed that B(n) is fairly robust 
with rapid convergence to the asymptotic distribution; another object 
of the thesis is to examine this hypothesis. 



[f h (x) - f (x)] 



2 




f (x) 



dx , 
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Going beyond asymptotic normality of $(n), the following result 
for a quadratic functional is also of some interest [2] . The func- 
tion a(x) is assumed to be a bounded piece-wise smooth integrable 

-2/9 

function. If b(n) = 0(n ) as n -*■ 00 , 



{b(n)}” 1 // 2 [nb(n) J’ [f n (x)- f (x) ] 2 a(x)dx -J‘nx)a(x)dxJ‘\i}[z) 2 dz] 
is asymptotically normally distributed with mean zero and variance 



2W (4) (0) f a(x) 2 f (x) 2 dx 



(4) 

as n 00 , where W is the 4-th convolution of W with itself* 
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II. SIMULATION 



The primary object of the simulation is to investigate the dis- 
tribution of the statistic $(n): 



0(n) 




[f n (x) - f (x)] 2 

fW 



dx , 



over a suitable range of integration. We have broken the simulation 
into the following cases: 



A. UNIFORM (0,1) CASE 
Since 



f (x) 



1 , 



0 , 



0 < x < 1 



otherwise 



$ (n) becomes 



3(n) 



l-b(n) 

■/ 

b(n) 



[f n (x)-l] 2 dx . 



We integrate from b(n) to l-b(n) instead of from 0 to 1, simply 

because we want to avoid the marked bias near 0 and 1. It can easily 

be shown that f (x) is biased in the end regions, but not otherwise, 
n 
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Thus the mean and variance of f (x) in this case is; 



E[f n (x)] 



This is only true for b(n) £ x £ l-b(n). Also 




The integration was performed in the simulation using Simpson's 
rule, dividing the interval of integration into 100 equally spaced 
sub-intervals. The results were found to be satisfactory in the 
sense that the value of the integral for 100 sub-intervals differs 



o 

i f 

(n) J 



dy 



x+b(n) 



(1- %?!■) ay 



b (n) 



x-b(n) 

x+b (n) 



*■ x-b(n) x- 

xT 

(n) L 



(x-y) 
b (n) 



x+b(n) 



(y-x) 
b (n) 



2b (n) - 



ay-/ 



1 (b(n)) 2 1 _ 

>(n) 2 J 



little from those with more sub-intervals 



To evaluate f (x) , use the following formula: 
n 

where the weight function W(u) is triangular, i.e.: 

! l-|u| , if | u | £ 1 

' 

0 , otherwise 

Xy j=l, ..., n are independent and identically distributed random 
variables. In this first case, they are uniform (0,1) random variables. 
Also b(n) is the bandwidth. We employed three bandwidths in the 
uniform (0,1) case, namely: 3/Vn, l//n, 1/n, one at a time. For each 
bandwidth, we used five different sample sizes, i.e. 100, 200, 500, 

1000, 1500, and for each sample size, we used five different random 
number seeds, i.e. simulated five times. The reason for this is to 
have the ability to assess the variability of the simulation results 
from the sections. The random number seeds are taken from the imme- 
diate previous "1500 sample size" output, except the first random 
number seed which is arbitrary. So, there are five sets of random 
numbers which are entirely different from one another. The following 
table should help to clarify the scheme: 



\ 
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TABLE I 



Bandwidth 


Sample size, n 


No* replications 


# Sections 




100 


2000 


5 


3 


200 


2000 


5 


n 1/2 


500 


2000 


5 




1000 


2000 


5 




1500 


2000 


5 




100 


2000 


5 


i 


200 


2000 ! 


5 


n 1/2 


500 


2000 


5 




1000 


2000 


5 




1500 


2000 


; 5 




100 


2000 


5 


1 


200 


2000 


5 


n 


500 


2000 


5 




1000 


2000 


5 




1500 


2000 


5 



For each sample size, n, repeat five times using different random 
number seeds . 

For the uniform (0,1) case, there are 75 outputs altogether. In 
each single simulation (one computer output), we evaluated 400 $(n)'s, 
say Bj (n) , j=l, ..., 400. The output includes the following: 

1. A bar histogram of the 400 3(n) 's. 

2. SNAP/IEDA graphs: 

a. A refined solid histogram. 

b. Graph of empirical log-survivor function, £n (1-j/ (N+l) ) vs 
B^j (n) , where B^j (n) is an ordered value. 
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c. Empirical CDF graph i/(n+l) vs B^j(n), where B^j (n) 
denotes the ordered observed Bj(n)'s. 

d. Normal probability distribution with estimated mean and 

variance. Norm vs e.c.d.f . of Bj (n) , where Norm is computed as 
follows : Let 



E B. (n) 
0 ? — 



and S ( 3 (n) ) = sample variance of the 



B.(n) 



Then 




e. Normal probability plot Norm vs i/(N+l). 

A sample output is included in the "Computer Output" section 
(page 30 - 35) . 

For each bandwidth and each sample size, there are five computer 
outputs, or 5 x 400 = 2000 Bj(n)'s. With this 2000 sample size, we 
constructed a histogram again. Also we used the Rosenblatt smooth- 
ing technique to get a smooth density function estimate for B(n). 
After a number of trials, we found that the bandwidth needed in the 
Rosenblatt estimator has very much to do with the smoothing and how 
"good" the density looks. Fortunately it is robust to a certain 
extent. This graph can give us a better picture of the true distri- 
bution than the classical histogram. The last graph of this complete 
output for sample size 2000 is the empirical log survival function; 



£n(l - i/(N+l)) 



vs 




(n) • 



\ 

\ 
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There are 15 "2000 sample size" computer outputs in the uniform (0,1) 
case. (Since each bandwidth has five "2000 sample size" outputs.) 

The complete computer outputs for each bandwidth and each sample 
size in the uniform (0,1) case are included in the "Computer Output" 
section (page 42 - 86) . 



B. THE CAUCHY DISTRIBUTION 

The next case investigated is the Cauchy distribution, again with 
a triangular weight function and with the range of integration (-3,3): 



3(n) 




(f n (x) - f(x)] 2 
f00 



dx . 



The interval of integration comprises 80% of the probability mass for 
this distribution. The integration is carried out, in this case, 
using Simpson's rule with 600 sub-intervals. This grid was used after 
examining the 100, 300, 600, 900 sub-interval cases to ensure that a 
fine enough grid is used in the numerical integration. 

The Cauchy density function is 



f(x) = — 

ir(l + x ) 

The density function estimator is the same as in the uniform case: 
f n (X> = nb (n) 53 W { b(n) } * 

j=l 

The weight function W(u) is again triangular. Thus the only differ- 
ence in this formula from the previous case is that X., j=l, . n 
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are i.i.d. Cauchy random variables. Note also that for finite n the 

estimator will have a bias component, this bias component usually 

decreasing, for given n,as bandwidth decreases. In that case though 

the (pointwise) variance of f^fx) increases and possibly also the 

variance of (3(n). Note that 8(n) and its distribution is a function 

of both sample size and bandwidth. 

The bandwidth b(n)'s used here are 1/Jn, 3/v^, 20/y/n , the last 

one representing a case in which bias in the estimate f (x) plays a 

n 

major role in the distribution of 8(n). 

The rest of the simulation scheme is exactly the same as for the 
uniform (0,1) case. 

A sample output for the 400 8j (n) case is included in the "Com- 
puter Output" section (page 36 - 41). Also the complete computer 
outputs (2000 replications) for each bandwidth and each sample size 
in the Cauchy case are included in the "Computer Output" section 
(page 87 -131) . 

C. HAND PLOT GRAPHS 

Besides the computer work, there are six hand plot graphs. These 
are the 8(n) quantiles versus the standard normal quantiles. They 
are included in the Appendix, to illustrate the rate of convergence 
of the distribution of 8(n) to its asymptotic normal distribution. 
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III. RESULTS 



The asymptotic result, conjectured by Rosenblatt, is that by 
using a triangular weight function when estimating the uniform 



density function on (0,1) 

. ^l-b(n) 

- 1/2 ( 

{b(n) } 



jnb(n) / 

^ ^ b (n) 



|f (x)-l| 2 dx-(l-2b(n)) / W(u) 2 du 



j W(u) 2 du| 



is asymptotically normally distributed with mean zero and variance 



2W (4) (0) (l-2b(n) ) 



-2/9 

as n ® if nb(n) 00 and b(n) = o(n ) 0 Here n is the sample 
size and b(n) the bandwidth. For the triangular weight function 



o 

/ 



W (u) 2 du = ~ 



(4) 

and W (0) the 4th convolution of W with itself at zero, is 302/630. 
From the above expressions, we get: 
l-b(n) 



E (@ (n) ) 



-[/ 

b(n) 



I f n (x) — 1 ] dx 



•]•» 



(l-2b(n) ) 
nb(n) 



l-b(n) 



Var (3(n) ) = Var 



f f |f n (x)-l| 2 J - 2» (4> (0)(l-2b(n)) . 

L J n -I n b(n) 

b(n) 



A comparison of the simulated values with that of the conjectured 
is tabulated below for the different cases investigated when f (x) is 
a uniform density: 
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TABLE II 



Comparison of estimated mean values and 
asymptotic mean values of 8(n) for dif- 
ferent bandwidths and sample sizes. 









E (8 (n) ) / (l-2b (n) ) 


n 


b(n)=3//n 


E(8(n) ) 


Conjectured 


Computer output 


100 


.3000 


.0089 


.0222 


.0127 


200 


.2121 


.0090 


.0157 


.0109 


500 


.1342 


.0073 


.0099 


.0075 


1000 


.0949 


.0057 


.0070 


.0058 


1500 


.0775 


.0048 


.0057 


.0051 








E(8(n) )/ (l-2b(n) ) 


n 


b(n)=l/»4i 


E (8(n) ) 


Conjectured 


Computer output 


100 


,1000 


.0533 


.0667 


.0583 


200 


.0707 


.0405 


.0471 


.0415 


500 


.0447 


.0271 


.0298 


.0269 


1000 


.0316 


.0197 


.0211 


.0197 


1500 


.0258 


.0163 


.0172 


.0168 



TABLE III 



Comparison of estimated standard deviation 
values and asymptotic standard deviation 
values of 8(n) for different bandwidths and 
sample sizes. 





b(n) =3//n 




a(8(n))/(l-2b(n)) 


n 


a(8(n)) 


Conjectured 


Computer output 


100 


.3000 


.0113 


.0283 ! 


.0115 


200 


.2121 


.0081 


.0141 


.0088 


500 


.1342 


.0046 


.0063 


.0047 


1000 


.0949 


.0029 


.0036 


.0030 


1500 


.0775 


.0022 


.0026 


.0023 




b(n)=l/» / ri 




a (8 (n) ) / (l-2b (n) ) 


n 


a(8(n)) 


Conjectured 


Computer output 


100 


.1000 


.0277 


.0346 


.0315 


200 


.0707 


.0171 


.0199 


.0189 


500 


.0447 


.0088 


.0097 


.0092 


1000 


.0316 


.0053 


.0057 


.0056 ( 


1500 


.0258 


.0040 


.0042 


.0043 
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Especially for small bandwidth the agreement between the asymptotic 
and simulated variances is very good even for small n(n=100). The 
same is true for the expected value, although convergence is slower 
than for O and again slower for large bandwidth. 

The simulation reveals that the distribution of the 0(n) statistic 
converges to normality very slowly (cf. the six quantile plots in the 
Appendix). For sample sizes 100, 200, 500, 1000, 1500, the histograms 
are all skewed to the right; the empirical log-survivor function plots 
are concave downward; and none of the normal probability plots is 
approximately straight lines (refer to "Computer Output" section) . 

It is very suggestive that the f3(n) statistic is approximately Gamma 
(0,k) distributed where the density of the Gamma distribution is: 

-kx/0 k-1 

f = (k/0) e _ x 
1 } (k-1) i 

and the mean and variance are: 

E (X) = 6 ; Var(X) = k0 2 . 

Therefore, for each underlying distribution, each bandwidth, and 
each sample size, we used the 2000 evaluated B(n)'s to estimate the 
0's and k's, and tabulate them as follows: 
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TABLE IV 



( 1 ) 



( 2 ) 



( 3 ) 



(4) 



where 



Estimated parameters for fitted Gamma distribution for 
3(n) in the case where the densities are uniform (0,1) 
and Cauchy, with a triangular weight function. 



Bandwidth = 1/n Underlying distribution: Uniform (0,1) 

Sample sizes and estimated 0's and k's 





100 


200 


500 


1000 


1500 (std. dev.) 


0 


.6540 


.6638 


.6612 


.6621 


.6645 ± .0045 


*** 

k 


39.48 


38.46 


32.95 


30.86 


31.06 ± 2.91 



Bandwidth = 1/Jn Underlying distribution: Uniform (0,1) 

Sample sizes and estimated 0's and k's 





100 


200 


500 


1000 


1500 (std. dev.) 


3 


.0565 


.0420 


.0279 


.0201 


.0166 ± .0003 


k 


3.62 


5.56 


8.63 


12.61 


16.92 ± 1.45 



Underlying distribution: Cauchy 



Sample sizes and estimated 0's and k's 





100 


200 


500 


1000 


1500 (std. dev.) 


e 


.3921 


.2800 


.1764 


.1258 


.1029 ± 


.0004 


k 


21.02 


31.49 


59.34 


79.19 


100.83 ± 


4.37 


Bandwidth = 2/Jri 


Underlying distribution: 


Uniform (0,1) 


Sample sizes and 


estimated 0's and k f s 






100 


200 


500 


1000 


1500 (std. dev.) 


e 


.0123 


.0107 


.0079 


.0060 


.0051 ± 


.0001 


k 


1.11 


1.59 


2.43 


3.65 


5.03 ± 


0.37 








Underlying distribution: 


Cauchy 


Sample sizes and 


estimated 0’s and k's 






100 


200 


500 


1000 


1500 (std. dev.) 


e 


.1247 


.0911 


.0576 


.0412 


.0340 ± 


.0004 


k 


8.62 


12.30 


20.32 


29.14 


33.98 ± 


1.55 



Bandwidth = 20/Vri Underlying distribution: Cauchy 

Sample sizes and estimated 0's and k's 





100 (std. dev.) 


200 


500 


1000 


1500 (std. dev.) 


0 


.0558 ± .0008 


.0262 


.0110 


.0066 


.0052 ± .0001 


** 

k 


6.71 ± 0.78 


4.03 


3.21 


4.10 


5.24 ± 0.16 



= mean , 

~ 2 

k s 1/ (estimated coefficient of variation) 
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The standard deviation estimates for the parameter estimates are 
obtained from the five identical simulations using different random 
number seeds, each generating a sample size of 400, as detailed in 



Table I 



IV. CONCLUSIONS 



Although the conjecture proposed by Rosenblatt is that the dis- 
tribution of the statistic £(n) is asymptotically normal, this is 
certainly not the case for moderate sample sizes such as those used 
in the simulations. Based on this fact, the conjecture has little 
practical value, but the fact that all cases examined give rise to 
an approximate Gamma distribution gives some hope for stability in 
the results. 

However, the method proposed by Rosenblatt for estimating density 
functions is no doubt plausible as can be seen from the Rosenblatt 
type estimates used in this study. More effort in this area is 
needed, particularly to relate the Gamma distribution parameters to 
underlying factors. Since we realize that for moderate sample sizes, 
the statistic 3(n) is not distribution free, and the bandwith b(n) 

(a function of sample size) plays a crucial role in estimating the 
density function and also the shape of the weight function W(u) is 
a major factor for evaluating f^fx), the future work may be to find 
the best combination of the above three factors by simulating the 
distribution of the statistic £(n) for different underlying distri- 
butions, bandwidths, sample sizes and weight functions and tabulate 
the resulting quantiles. Then we may be able to put the statistic 
3(n) to actual use. 
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APPENDIX 



Included in this Appendix are six plots of sample size versus 
estimated quantiles of the statistic $( n) , normalized by subtract- 
ing the estimated mean and dividing by the estimated standard 
deviation. Also plotted as straight lines are the corresponding 
quantiles of the unit normal distribution. 

The 17 quantiles, extracted from each 2000-$ (n) sample, are 
0.010, 0.020, 0.025, 0.05, 0.10, 0.20, 0.30, 0.40, 0.50, 0.60, 0.70, 
0.80, 0.90, 0.95, 0.975, 0.980, 0.990. 

Other data for each plot are self-explanatory. 



\ 
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Estimated g(n) quantiles & N(0,1) quantiles 




A. 3, 



Eatiaatad $(n) quantiles ft N(0,1) quantllaa 
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Estimated 0(n) quantiles £ N(0,1) quantiles 




Estimated 0(n) quantiles fi N(0,1) quantiles 




< 2 . 5 * 



i \ 



Estimated 8 (n) quantiles & N(0,1) quantiles 




Estimated 3(n) quantiles & N(0,1) quantiles 







COMPUTER OUTPUT 



There are altogether 102 sheets of graphs . All of them are self- 
explanatory, except that the letter "X” in the graphs represents the 
statistic £(n). 

The following is a list of index of the graphs. 



Page Description 

30 - 35 sample computer output for the case: 

underlying distribution-uniform (0,1) ; 
underlying distribution sample size n=1500; 
bandwidth 3/^/n; 

statistic £(n) sample size m=400. 

36 - 41 sample computer output for the case: 

underlying distribution-Cauchy ; 
underlying distribution sample size n=100; 
bandwidth l//n; 

statistic £(n) sample size m=400. 



The rest of 


the computer graphs in 


this section are 


all for statistic 


B(n) sample 


size m=2000. The page 


index is as follows. 


Page 


Underlying 

distribution 


Underlying 
distribution 
sample size 


Bandwidth 


42 - 44 


U(0,1) 


100 


3/Sn 


45 - 47 


0(0,1) 


200 


3/Sn 


48 - 50 


U(0,1) 


500 


3/v£ 


51 - 53 


U(0,1) 


1000 


3/v£ 


54 - 56 


U(0,1) 


1500 


3/i^n 


57 - 59 


U(0,1) 


100 


l//n 


60 - 62 


U(0,1) 


200 


1/Jn 



1 \ 
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Page 


Underlying 

distribution 


Underlying 
distribution 
sample size 


Bandwidth 


63 - 65 


U(0,1) 


500 


1/i/n 


66 - 68 


U(0,1) 


1000 


l/v£ 


69 - 71 


U<0,1). 


1500 


l/v£ 


72 - 74 


U<0,1) 


100 


1/n 


75 - 77 


U<0,1) 


200 


1/n 


78 - 80 


U(0,1) 


500 


1/n 


81 - 83 


U<0,1) 


1000 


1/n 


84 - 86 


U<0,1) 


1500 


1/n 


87 - 89 


Cauchy 


100 


1/Jn 


90-92 


Cauchy 


200 


l//n 


93 - 95 


Cauchy 


500 


l//n 


96 - 98 


Cauchy 


1000 


1/y/n 


99 - 101 


Cauchy 


1500 


l//n 


102- 104 


Cauchy 


100 


3/v£ 


105- 107 


Cauchy 


200 


2/Jn. 


108- 110 


Cauchy 


500 


3/Jn. 


111- 113 


Cauchy 


1000 


3/Jri 


114- 116 


Cauchy 


1500 


3//ri 


117- 119 


Cauchy 


100 


20//n 


120- 122 


Cauchy 


200 


20/Jri 


123- 125 


Cauchy 


500 


20/v^n 


126- 128 


Cauchy 


1000 


20/i/n 


129- 131 


Cauchy 


1500 


20/ t/H 



1 
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INTEGRAL SQLARE NORM RANDOM VARIABLE 
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IMl-l/Ml) VS X EWFIPICAL LOG SURVIVAL FUNCTION 
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INTEGRAL SCL/FE KCE* SAMLc SIZE M - 2 CC C 
UNlFCHf V^BlflELE !i4lE SIZE M = 200 
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ESTIMATED CtNSin FLKUCA VS INTEGRAL SCU4PE NCRM PANCCM VARIABLE 

INTEGRAL SCLAPS N r H 44 SAMPLE SIZE = 200C TRIANGULAR VilNDOW BANCMDTh ■ RANGE 

ESTf«-4TEC CEASTTV FLKTICK »S EVALUATED 47 ICC EQUALLY SPACED PflMTS 
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LK(H/KM! VS ) EMPIRICAL LCG SLRVIVAL FUNCTION 



FPECLEKCIES SAMPLE SIZE - 2000 

63 161 278 274 234 2C* 175 135 III 63 71 37 37 26 21 16 l 
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ESTIMATED CENSMY FUNCTICN VS INTEGRAL SCLARF NCRF F AND CM VARIABLE 

INTECRAL SCL/FE N C F M SAFFLE SIZE = 20 JO TK I ANGULAR RINDQW BANDUIOTH ■ RANGE 

ESTIMATED CENSI1Y FLNCTICN IS EVALUATED AT 100 EQUALLY SPACED PUINT$ 
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INTECRAL SCU A R E NfFK SAFFLE SIZE M - 2CCC 
LMFCRR QAKCCF VAMAELf SAHfLE SIZE H * I5CC 
TRIAAGLLAR hlAOth. E AFCVi I CTF » 3/SCFTCM 
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ESTIKiTIC TEKSny FLNCTICN VS INTEGRAL SQUARE NCRM RANDOM VARIABLE 

INTfcCRAL SCLAPE NCFK S A N F l E SIZE - 2000 TRIANGULAR UNDO* BAN CW IDT h - RANGE 

ESTIMATED CENSITY FUNCMCIt IS EVALUATED AT ICC EQUALLY SPACED POINTS 
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INTfGFAL SCU/st*: MiFM SAMPLE SIZE f« =• 2 ‘JCC 
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LMl-I/N+n VS a EMPIRICAL ICG SURVIVAL FUNCTION 
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INTEGRAL SQUARE NCPM SAMPLE SIZE M * 2000 
IJNlMflN RANC 0 M VARIABLE SAMPLE SIZE N * 200 
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INIi-I/U+ll S > EMPIRICAL LCG SURVIVAL FUNCTION 
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ESTIMATE) cr\SMY FUfjrT|t'N VS I N T EGRAL SQLARE NCR* RANDOM VARIABLE 

INTEGRAL SQUARE NORM sample size <= 20 JO TR I ANGULAR WINDOW BAN0WI0TH - RANGE 
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fcMPlRICAL LOG SURVIVAL FUNCTION 



FREQUENCIES SAMPLE SIZE • 2000 
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INTF.GRAL SQUARE NCRM SAPFLE SIZE M - 2000 
UNIFORM RANOOM VARIABLE SAMPLE SIZE N * 1000 
TRIANGULAR NINOO*. EANOWICTH - I/SQRT(N» 
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ESTIMATEO CENSITY FUNCTION VS INTEGRAL SQUARE NCRH RANDOM VARIA8LE 

INTEGRAL SCUAPE NCPM SAMPLE SIZE = 2000 TRIANGULAR WINOOW 8AN0WI0TH - RANGE 

ESTIMATEO DENSITY FUNCTICN IS EVALUATED AT 100 EQUALLY SPACED POINTS 
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EMPIRICAL LOG SURVIVAL FUNCTION 



FREQUENCIES SAMPLE SUE - 2000 
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UNIFORM RANOOM VARIABLE SAMPLE SUE N * LSOO 
TRIANGULAR WlNOCw. BANOWIDTI* * l/S0KT(Ni 
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ESTIMATED density function vs integral square norm random variable 

INTEGRAL SQUARE NORM SAMPLE SIZE - 2000 TRIANGULAR WINDOW BANOwIOTh • RANGE 

ESTIMATED DENSITY FUNCTION IS EVALUATED AT IDO EQUALLY SPACEO POINTS 
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INI1-I/N*!) VS X EMPimCAL LCG SURVIVAL FUhCTION 



FREQUENCIES SAMPLE SIZE • 2000 
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ESTIMATED CFNS1TV FUNCTION VS INTEGRAL SQUARE NORM RANDOM VARIABLE 

INTEGRAL SCUAPF NCR* SAMPLE SIZE * 2000 TRIANGULAR WINDOW BANDWIDTH - RANGE 

ESTIMATED OENSITY FUNCTION IS EVALUATED AT 100 EQUALLY SPACED POINTS 
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EMPIRICAL LOG SURVIVAL FUNCTION 



FBtQUEMClES SAMPLE SUE - 2000 

5 12 31 55 105 Itft 2 C 7 200 222 234 ITS 154 137 93 75 3d 3 
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ESTMAT c n CENSITV FUACTICK Vi INTEGRAL SQUARE UCRM KANO UK VARIABLE 

INTEGRAL SCUA«6 MO*M SAKRI.f SIZT = 2000 tr{ ANGULAR WINDOW BANOWlOTH » RANGE 
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FMEUUFKCtfS SAMPLE StZE - 2000 
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ESTIMATEO OENSITY FLNCTICN VS IMFGRAL SQUARE NCRM RANOOM VARIABLE 

INTEGRAL SCUAFF NCRM SAKPLE SIZE = 2000 TRIANGULAR WINDOW BANOHIOTH « RANGE 

ESTIMATEO CFKSITY FUKCT1CN IS EVALUATED AT 100 EQUALLY SPACED POINTS 
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IN ( l- 1 /f‘+ 1 > VS > EMPIRICAL LOG SURVIVAL FUNCTION 



FRPQUEMCIPS SAMPLE SIZE ■ 2000 
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ESTIMATED DENSITY FUKCTICN VS INTEGRAL SQUARE N3RM RANDOM VARIABLE 

INTEGRAL SCUARF NTRm SAMPLE SIZE = 2000 TRIANGULAR WINDOW BANDWIDTH - RANGE / 10 

ESTIMATED DENSITY FUNCULN IS EVALUATED AT 100 EOJALLY SPACED POINTS 
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IMI-I/MU VS X I R I C AL LOG SURVIVAL FU**CT I ON 



FREQUENCIES SAMPLE SIZE * 2000 
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ESTIMATED CENSITY FUNCTICN VS INTEGRAL SQUARE NORM RANOOM VARIABLE 

INTEGRAL SQUARE NORM SAMPLE SIZE * 2000 TRIANGULAR W INOOW BANOWIUTH ■ RANGE 

ESTIMATED CENSITY FUNCTICN IS EVALUATED AT 100 EQUALLY SPACEO POINTS 
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EMPIRICAL LOG SURVIVAL EUNC T I ON 



FREQUENCIES SAMPLE SIZE ■ 2000 
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ESTIMATED DENSITY FUM^TICf^ VS INTEGRAL SQUARE NCRM RANDOM VARIABLE 

INTEGRAL SQUARE NCRM SAMPLE SIZE =■ 2QDD TRIANGULAR WINDOW BANDWIDTH * RANGE / 10 

ESTIMATED CEKSI7Y EUNCTICN IS EVALUATED AT 100 EQUALLY SPACEO POINTS 
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LNU-I/Ml) VS > EM PJ PI CAL LOG SURVIVAL FUNCTION 



FREQUENCIES SAMPLE SIZE • 2000 
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INTEGRAL SCUARE NORM SAMPLE SIZE M * 2000 
CAUCHY RAN00H VARIABLE SAMPLE SIZE N * 500 
TRIANGULAR WINDOW. 8AN0WI0TH - I/SURTtN) 
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LNU-I/Ntl) VS X EMPIRICAL LOG SURVIVAL FUNCTION 
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INTEGRAL SCUAPE NHPM SAMPLE SUE H « 2000 
CAUOV P ANOOM VARIAUL. 5 * SAMPLE SUE N * 1000 
TRIANGULAR MADCH. CAA3-UCTH - l/SQRT(N) 
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ESTIMATED CENSIlY FUNCTICN VS INTEGRAL SQUARE NORM RANDOM VARIABLE 

INTEGRAL SQUARE NCRH SAMPLE SIZE = 2000 TRIANGULAR WINDOW BANOWIOTH ■ RANGE / 10 

ESTIMATED DENSITY FUNCTICN IS EVALUATED AT 100 EQUALLY SPACED POINTS 






o* 

•*■ 



^ 8 



i 



EMPIRICAL LCG SURVIVAL FUNCTION 



FREQUENCIES SAMPLE SIZE * 2000 
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INTEGRAL SQUARE NORM SAMPLE SIZE M - 2000 
CAUCHY RAK0CP VARIABLE SAMFLF SIZE N - 1500 
TRIANCULAR WIK00N. BANGUI CTH > I/SQRTIN) 
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F ST I MATEO OENSITY FUNCTION VS INTEGRAL SQUARE NORM RANOOM VARIABLE 

INTEGRAL SCUARE NORM SAMPLE SIZE * 2000 TRIANGULAR WINDOW BANDWIOTH « RANGE 

ESTIMATED CENSITY FUNCT ICN IS EVALUATED AT 100 EQUALLY SPACED POINTS 
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INTEGRAL SCLAPE NCP# SAFFIE SIZE f* * 2000 
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ESTIMATED CENSITY FUNCTION VS INTEGRAL SQUARE NORM RANDOM VARIABLE 

INTEGRAL SCUARE NCRM SAMPLE SIZE = 2000 TRIANGULAR WINDOW BANDWIOTH - RANGE 

ESTIMATED DENSITY FUNCTION 15 EVALUATED AT 130 EQUALLY SPACED POINTS 
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LNII-I/N+II VS X EMPIRICAL COG SURVIVAL FUNCTION 



FBFgUENCUS SAMPLE SIZE - 2000 
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C 4 UCFY RAN 03 M VARUdLE SAMPLE SUt N * 200 



**.2i*i J**Si***********iiSSJ*i2i#****+**#*#?i2!JS*° l 



o 

o 






rg 

O 

( 

UJ 

in 

*M 

O' 



* 



*N 

O 



m 

rg 

<0 



m 



<-n w 



UJ UJ 

m *\i 

— • o 

<M rg 



O O 

« « 
x X 
* * 
■ X 



I O 6 



\ 



ESTIMATEO CENSITV FUNCTION VS INTEGRAL SQUARE NORM RANOOM VARIABLE 

INTEGRAL SQUARE NORM SAMPLE SIZE = 2000 TRIANGULAR WINOGW BANOM IOTH - RANGE 

ESTIMATEO CENSITV FUNCTION IS E VALUATEO AT 100 EQUALLY SPACEO POINTS 
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integral square norm sample size M • 2000 

CAUCHY RAN0CM VARIABLE SAMPLE SUE N * 500 
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ESTIMATED CFNSITY FUNCTION VS INTEGRAL SQUARE NORM RANDOM VARIABLE 

INTEGRAL SCUAR5 NORM SAMPLE SUE = 2000 TRIANGULAR WINDOW BANDWIDTH - RANGE 

ESTIMATED OENSITY FUNCTION IS EVALUATED AT 100 EQUALLY SP4CE0 POINTS 
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INU-I/N4U VS X EMPIRICAL LOG SURVIVAL FUNCTION 



FREQUENCIES SAMPLE SIZE - 2000 
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ESTIMATED CENSHY FUNCTILN VS INTEGRAL SQUARE NORM RANDOM VARIABLE 

INTEGRAL SCLAPE NCRK SAMPLE 5 ! / E = 2JU0 TRIANGULAR WlNOGW BANDWIOTH • RANGE 

C $TIHATED CCNSITY FUNCTICf* IS EVALUATEO AT IOC EQUALLY SPACED POINTS 
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INTEGRAL SCUAPE NCRM SAMPLE SIZE H * 2000 
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ESTIHATEO DENSITY FUNCTICN VS INTEGRAL SQUARE NCRH RANOOM VARIABLE 

INTEGRAL SQUARE NORM SAMPLE SIZE - 2000 TRIANGULAR WINDOW BANOWIDTH * RANGE / 10 

ESTIHATEO DENSITY FUNCTION IS EVALUATEO AT 100 EQUALLY SPACEO POINTS 
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LN( l-I/N*ll VS X EMPIRICAL LOG SURVIVAL FUNCTION 



MfgUCNCKS SAMPlf Silt ■ 2000 
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INTEGRAL SCUARE NCRM SAMPLE SIZE N • 2000 
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ESTIMATED DENSITY FUNCTICN VS INTEGRAL SQUARE NORM RANDOM VARIABLE 

INTEGRAL SCUARE NORM S APPLE SIZE » 2000 TRIANGULAR WINOOW BANOWIDTH ■ RANGE / 10 

ESTIMATEO OENSITY FUNCTICN IS EVALUATED AT 100 EQUALLY SPACEO POINTS 
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LN(L-I/N*L) vs * EMPIRICAL LOG SLRVIVAL FUNCTION 



FREQUENCIES SAMPLE SUE ■ 2000 
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INTEGRAL SCUARE NORM SAMPLE SIZE M * 2000 
CAUCHY RANOOH VARIABLE SAMPLE SIZE N » 200 
TRIANGULAR WINOOW. BAN0WI0TH - 20/SQRTINI 
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EMPIRICAL LOG SURVIVAL FUNCTION 



FREQUENCIES SAMPLE SUE * 2000 
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INTEGRAL SQUARE NQRH SAMPLE SIZE H • 2000 
CAUChY RANOQM VARIABLE SAMPLE SIZE N * 500 
TRIANGULAR taINDOW. 0 ANOWI OTH « 20/SQRTCNI 
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ESTIMATED OENSITY FUNCTICN VS INTEGRAL SQUARE NCR M RANDOM VARIABLE 

INTEGRAL SQUARE NORM SAMPLE SIZE = 2000 TRIANGULAR WIN00W BANDWIOTH - RANGE / 10 

ESTIMATED DENSITY FUNCTICN IS EVALUATEO AT 100 EQUALLY SPACED POINTS 
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EMPIRICAL LOG SURVIVAL FUNCTION 



FREQUENCIES SAMPLE SUE - 2000 
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INTEGRAL SQUARE NORM SAMPLE SIZE M » 2000 
CAUCHY RANDOM VARIABLE SAMPLE SIZE N » 1000 
TRIANGULAR HINOOW. BANOWIOTH - 20/SCRT(N» 



!•« 

u» 



•*(NJ 

•*o 
•* I 



I * 

UJ* 

o« 



o* 

t • 

u*k 



i ai 



1 \ 



ESTIMATEO DENSITY FUNCTION VS INTEGRAL SQUARE NCRM RANOOH VARIABLE 

INTEGRAL SQUAflE NGRN SAMPLE SIZE - 2000 TRIANGULAR WINOOW BANOWIOTH - RANGE 

ESTIMATEO DENSITY FUNCTION IS EVALUATEO AT 100 EQUALLY SPACEO POINTS 
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LN(l-I/N^n VS X EMPIRICAL LOG SURVIVAL FUNCTION 



FREQUENCIES SAMPLE SUE • 2000 
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